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Abstract 

Most institutions of higher education require evidence of effective teaching as part of the 
review process for reappointment and for tenure/promotion. This study examines the 
relative importance of various documents placed in the teaching section of the dossier as 
rated by pre-tenured faculty members and review committee members. Results indicate 
that these documents are viewed quite differently by the two groups. There is a danger 
that the differences in the value that various documents play may affect the evaluation of 
teaching effectiveness and the final outcome of reappointment and tenure/promotion de¬ 
cisions. These differences weaken the evaluation process. There must be a common un¬ 
derstanding of the importance of particular documents and how those pieces of evidence 
assist in telling the story of teaching effectiveness. 

Keywords: Teaching Dossiers, teaching portfolios, teaching evaluation, teaching 
effectiveness. 


One of the most important and controversial issues during the last 20 years has been the 
re-evaluation of how faculty performance should be measured and assessed. While tradi¬ 
tional methods of faculty assessment can easily quantify the number of student instruc¬ 
tional hours offered, activities assigned, and exams given, traditional methods do little to 
capture evidence of learning effectiveness. 

Evaluating teaching effectiveness is a task performed by nearly all faculty in almost 
every institution of higher education. Yet, the evaluation of teaching is controversial, 
particularly as it relates to the core elements of teaching effectiveness. Ory (2000) exam¬ 
ined the evolution of teaching evaluations from the 1970s (aimed at developmental pur¬ 
poses including faculty improvement) through the 1980s and 1990s (driven by adminis¬ 
trative needs related to budget restrictions) through the 2000s (driven by demands for ac¬ 
countability, a renewed interest in teaching improvement, and the search for more valid 
metrics of teaching effectiveness). 

Shao, Anderson, and Newsome (2007) acknowledge the considerable debate concerning 
the evaluation of teaching effectiveness. They found that there is a significant difference 
between what respondents feel is currently being used and what should be used to evalu- 
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ate teaching effectiveness. With regard to teaching effectiveness, respondents tend to be¬ 
lieve that currency in field, peers evaluations, classroom visits, and professor’s prepara¬ 
tion should be given more weight, while teaching awards and use of technology should 
not be given as much weight as they currently are. Respondents differed in their opinions 
depending on whether they were from doctoral, masters, or baccalaureate schools; on 
whether they were faculty, department chairs or deans; and on years of experience they 
had in higher education. Some research has found a high degree of similarity between 
what instructors and students considered important to effective teaching (Miller et ah, 
2001; Keller, Mattie, Vodanovich, & Piotrowski 1991). 

One tool to help evaluate teaching effectiveness is the teaching dossier or portfolio. 
Broadly speaking, the dossier is a document that offers evidence of teaching/leaming ef¬ 
fectiveness such as student ratings and classroom observation reports in addition to a 
teaching philosophy, evidence of instructional innovations, assessment efforts, and teach¬ 
ing-related activities such as mentoring, classroom research, and related work. It is a 
structured collection of teacher and student work created over time that is framed by re¬ 
flection and enriched through collaboration (Wolf & Dietz, 1998). Teaching dossiers can 
have a range of applications useful to teachers in college and university environments 
(Ouellett, 2007). A dossier is flexible; it might be useful as a developmental tool (forma¬ 
tive purpose) to stimulate individual reflection and personal development, or it can be 
used as an evaluative tool (summative purpose) to offer support for tenure and promotion 
decisions, to supplement job application infonnation, and to offer evidence of teaching 
effectiveness for teaching awards. 

A potential barrier with evaluating teaching, however, is that evaluations can simultane¬ 
ously serve formative and summative roles (Casey, Gentile & Bigger, 1997; Mills and 
Hyle, 1999; Pratt, 1997; Smith and Tillema, 2007). Seldin (2004) and Bernstein (1996) 
believe that teaching portfolios have the capacity to be used for assessment of teaching 
quality as well as development of enhanced teaching performance. Cavanagh (1996) ar¬ 
gued to keep the two purposes separate and to conflate them, where the peer is also the 
evaluator, may create situations where faculty may not want to risk having frank discus¬ 
sions of weaknesses necessary to improve teaching. Tigelaar, Dolmans, Wolfhagen and 
van der Vleuten (2005), in their study of teaching portfolio assessment, suggest that an 
integration of formative and summative is possible, but acknowledged the considerable 
effort involved in achieving each end. De Rijdt, Tiquet, Dochy, and Devolder (2006) 
found fewer than a quarter of academics producing portfolios, partially due to the am¬ 
bivalence surrounding the multiple functions. Conrad and Bowie (2006) found that the 
context in which portfolios are read, summative or fonnative, yield distinct outcomes. 

Clearly, there is a tension between formative and summative purposes. Knapper and 
Wright (2001) claim that the differences between summative and formative portfolios are 
“not as great as might be expected” (p.25). On the other hand, Chism (2007) notes that 
the distinction between formative and summative reviews is often difficult to sustain in 
practice. Pan et al. (2009) note that formative purposes require precise and specific feed¬ 
back as a guide to improving teaching perfonnance; summative purposes need an overall 
parsimony that facilitates the evaluation process. Buckridge (2008) questioned whether 
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portfolios can have mixed formative and summative purposes and still meet the needs of 
institutions. She notes that while there has been some success to fulfill both purposes, 
such success can diminish the currency of teaching and replace deeper transformative op¬ 
portunities with ore expedient concerns of faculty promotion. 

Additionally, some experts argue that the definition and measurement of effective teach¬ 
ing are ambiguous and subjective (Dilts, 1980) and that the evaluation of teaching is un¬ 
systematic (Seldin, 1984). On the other hand, there are those who believe that teaching 
can be effectively defined and measured (Cashin, 1988). Chism (1999) found that there 
is a great deal of consensus on what characterizes effective teaching. Among those fac¬ 
tors that are consistently mentioned are subject matter competence, preparation and or¬ 
ganization, enthusiasm, and interpersonal rapport. Fink (2008) identified four fundamen¬ 
tal tasks of teaching: knowledge of subject matter, designing learning experiences, inter¬ 
acting with students, and course management. He argued that if the teacher does all four 
well, students will have a good learning experience. To the degree that the teacher does 
one or more poorly, the quality of the learning experience declines (p. 39). Arreola 
(2007) identified five broad skill dimensions required for competent teachers: content 
expertise, instructional design skills, instructional delivery skills, instructional assessment 
skills, and course management skills. Hatch (2006) described teaching as “a complex 
intellectual endeavor that demands disciplinary expertise, a deep understanding of stu¬ 
dents, and sophisticated pedagogical skills.” 

Many faculty members believe that it is far easier to evaluate the quality of research than 
of teaching due to its explicit and tangible outcome, generally by number and value of 
publications. Furthermore, publications can be reviewed by outside experts in a particu¬ 
lar field. The effects of teaching are not as easily quantifiable, and rarely is the act of 
teaching or its products reviewed by experts outside one’s institution. 

At most institutions, tenure and/or promotion decisions are based on the evaluation of a 
faculty member’s teaching, research, and service. As difficult as some believe it is to 
evaluate, teaching is an essential part of a faculty member’s promotion and tenure dos¬ 
sier. Therefore, faculty members are required to include a section that describes the fac¬ 
ulty member’s teaching activities and includes indicators of teaching effectiveness. 

In order to present a comprehensive picture of teaching, a faculty member includes 
documents and artifacts that represent products of good teaching. Often included are ad¬ 
ministrator’s annual reviews, course syllabi, teaching materials, teacher-developed tests, 
journals or diaries, videotapes of teaching, peer observation reports, samples of student 
work, letters of support, and student course evaluations. Statements of educational phi¬ 
losophy and narratives that help to interpret the artifacts may be included in the dossier 
and may provide evidence of a faculty member’s teaching effectiveness. According to 
Edgerton, Hutchings, and Quinlan (1991) the ideal dossier should highlight a professor’s 
reflections about a sample of actual work explaining his or her instructional decisions. 
The dossier should also include documentation of any classroom research that was car¬ 
ried out and an explanation of changes made as a result of the investigation. In addition 
to the dossier itself, it is not unusual to present a box filled with supporting materials to 
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the tenure and promotion committee. Cannon (2001) proposed that the three essential 
components of a teaching portfolio are the personal statement of teaching, an overview of 
teaching accomplishments and activities, and verification of the success of the activities 
through feedback from colleagues (classroom observation reports) and students (student 
evaluations). 

Although considerable research exists concerning the relative importance of teaching 
portfolios, much less work exists on the assessment of portfolios for summative purposes 
(Burroughs, 2001; Smith & Tillema, 2003, 2007). Reviewers make important decisions 
about the effectiveness and contributions of faculty members by evaluating the docu¬ 
ments the faculty member has placed in the dossier. However, research has supported 
that those implementing teaching portfolios have different views from those experiencing 
their direct and indirect impacts (Leggett & Bunker, 2006; Taylor, 2001; Guest & Duhs, 
2003; Stephenson, 2004). At issue here is the match between the relative importance the 
faculty member puts on individual documents placed in the teaching section of the dos¬ 
sier and the importance the review committee and administrators place on these same 
elements. Naturally, a faculty member would emphasize a particular element in some 
way in order to present the best case. A mismatch by reviewers and the faculty member 
involving the perceived importance of particular elements in the teaching dossier could 
possibly cause a faculty member to misrepresent his or her teaching to the degree that 
tenure and/or promotion might be jeopardized. 

This paper is part of a larger study in which various aspects of teaching across the univer¬ 
sity were examined. The purpose of this paper is to make a comparison between pre¬ 
tenured faculty members and tenured reviewers regarding the relative importance as¬ 
cribed to various documents placed in the teaching section of the reappointment or ten¬ 
ure/promotion dossier. 


Method 

The research was completed at a southeastern urban research university of about 17,000 
students, 700 faculty members, and six colleges, with the proposal of a seventh college 
recently accepted. The first doctoral programs were begun in 1996. At the time of the 
study, six doctoral programs were in existence and three more programs had been granted 
pennission to plan. (The university has continued to expand in the interim with more 
students, more faculty, and more doctoral programs.) Since the establishment of doctoral 
programs, the institution has been moving quickly toward a research agenda. While re¬ 
search expectations for tenure and promotion have increased, the university maintains its 
focus on the importance of teaching. Several examples illustrate this point. Among them 
are two university level awards: Each fall, the Bank of America Award for Teaching Ex¬ 
cellence is presented at the end of a week of activities that highlight teaching. The Pro¬ 
vost’s Award for Excellence in Teaching is awarded annually to a department that has 
demonstrated special emphasis on teaching. In addition, the title of Cone Distinguished 
Professor for Teaching has been awarded to a number of faculty members who excel in 
this area. Finally, individual college awards are given each year to faculty members who 
show evidence of distinguished accomplishments in the area of teaching. 
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Procedures 

Instrumentation . Respondents were asked to consider the usefulness of fifteen docu¬ 
ments or artifacts that are usually included in the teaching section of a faculty member’s 
dossier for reappointment or for promotion and tenure (see Table 1 for the specific docu¬ 
ments). They were asked to rate each item in terms of its usefulness in evaluating the 
faculty member’s teaching. The following scale was used: very useful, somewhat useful, 
minimally useful, not useful. We note that a neutral response was not included in order 
to force the respondent to decide whether they lean more towards one end of the scale or 
the other. A fifth choice, not applicable (NA), was also included. 

Subjects 

Two different groups were the focus of this study: pre-tenured faculty members and fac¬ 
ulty members who serve as reviewers for reappointment and tenure/promotion cases. 
Questionnaires were distributed to 343 tenured faculty members, including administra¬ 
tors, in order to capture all who might have been in the position of reviewing faculty for 
reappointment or tenure and promotion. The questionnaires were delivered to department 
review committee members and college review committee members in the middle of the 
fall semester to coincide with the completion of their peer reviews. Questionnaires were 
distributed to department chairs and college deans at the end of the same semester to co¬ 
incide with the timeline for their review. Responding to the questionnaires as soon as the 
round of reviews was completed was considered imperative in giving results that re¬ 
flected how reviewers used specific documents in the teaching portfolio. Questionnaires 
were mailed to 127 faculty members who had just been reviewed an all pre-tenured fac¬ 
ulty members. New hires were not included due to the fact that they were less familiar 
with the tenure process at the time of the survey. These numbers represent the entire 
population involved in the promotion and tenure process at the university. 

Results 

One hundred fifty tenured faculty members responded to the survey. Although surveys 
were sent to all tenured faculty members and administrators, only those who served on a 
reappointment or tenure committee during the semester of the study were asked to com¬ 
ment on the relevance of teaching portfolio documents. Eighty of the one hundred fifty 
respondents fell into this category, and were able to evaluate the relevance of portfolio 
documents. Eighty pre-tenured faculty members returned the questionnaire and 53 (those 
whose dossiers were under review) rated the importance of portfolio documents. 

Results were averaged using the following scale: very useful = 3, somewhat useful =2, 
minimally useful =1, not useful = 0. Missing data or responses that indicated an item was 
not applicable were not included in the averages. A summary of results is found in table 
1. Portfolio items are listed in column one. The data in columns two and three corre¬ 
spond to the two groups (tenured reviewers or pre-tenured faculty) of respondents. Items 
are ordered in descending importance as judged by the tenured reviewers. 
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TABLE 1. Comparison of the Relative Importance of Various Components of a Teaching 
Dossier. 


Dossier Item 

Reviewers 

Item Mean 

Pre-tenured Faculty 

Item Mean 

Contributions to teaching within the 
university 

2.43 (rank = 1) 

2.16 (rank = 6) 

Course and curriculum development 

2.42 (rank = 2) 

2.11 (rank = 7) 

Teaching awards 

2.41 (rank = 3) 

2.03 (rank = 9-10) 

Student evaluations 

2.40 (rank = 4) 

1.70 (rank =15) 

Contributions to teaching nationally or 
internationally 

2.38 (rank = 5) 

2.10 (rank =8) 

Faculty narrative 

2.34 (rank = 6) 

2.24 (rank = 3) 

Department chair’s annual review 

2.32 (rank = 7) 

2.34 (rank=l) 

Peer observation reports 

2.32 (rank = 8) 

1.87 (rank =12-13) 

Description of teaching activities 

1.92 (rank = 9) 

2.27 (rank = 2) 

Letters from outside reviewers 

1.85 (rank =10) 

1.89 (rank =11) 

Course syllabi 

1.81 (rank =11) 

2.18 (rank =5) 

Letters from faculty within the univer¬ 
sity 

1.59 (rank =12) 

2.03 (rank = 9-10) 

Student work/artifacts 

1.57 (rank =13) 

2.22 (rank = 4) 

Course exams/tests/quizzes 

1.44 (rank =14) 

1.86 (rank =14) 

Letters from students 

1.36 (rank =15) 

1.87 (rank =12-13) 


Notes: 

3 = very useful 
2 = somewhat useful 
1 = minimally useful 
0 = not useful 

Rank 1 = highest ranking in terms of importance (most important) 

Rank 15 = lowest ranking in terms of importance (least important) 

It is apparent that the means in column two (tenured reviewers) fall into two distinct 
groups: There were eight items with means greater than or equal to 2.32 (group one) and 
seven items with means less than or equal to 1.92 (group two). Dependent /-tests were 
done comparing review committee responses to pairs of items in group one. At a 95% 
confidence level, there were no significant statistical differences within this group. How¬ 
ever, the same type of tests showed that any pair that included an item from group one 
and an item from group two, showed statistically significant differences. This two-tiered 
rating was not apparent among pre-tenured faculty. Indeed, although the overall means 
of the fifteen items were roughly the same for both groups (overall mean for column 2 is 
2.04, while the overall mean for column 3 is 2.06), the pre-tenured faculty members rat¬ 
ings were generally closer to the mean—only ranging from 1.70 to 2.34. Dependent t- 
tests do, however, show that pre-tenured faculty had a statistically significant preference 
for the chair’s annual review, a description of teaching activities, faculty narrative, and 
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student work over peer observation reports, letters from students, course exams, and stu¬ 
dent evaluations. 

Both tenured reviewers and pre-tenured faculty are in general agreement about the rela¬ 
tive importance of several portfolio items: letters from outside reviewers (on teaching), 
letters from students, and sample course exams are relatively less important, while the 
faculty narrative and the chair’s annual review are relatively important. There are, how¬ 
ever, striking differences on some items: Tenured reviewers rate teaching awards, stu¬ 
dent evaluations, and peer observation reports highly (3 rd , 4 th and 8 th —all in the top tier), 
while pre-tenured faculty members rate these items 9 th , 15 th , and 12 th . Reversing this dif¬ 
ference, tenured reviewers ranked the description of teaching activities, course syllabi, 
and student work/artifacts as second tier items while pre-tenured faculty ranked them 2 nd , 
5 th , and 4 th on their list. Table 2 illustrates these comparisons while taking into account 
the fact that reviewers and pre-tenured faculty have different levels of variation among 
items. This table was constructed by standardizing the entries in each column of table 1. 
A positive score in Table 2, column 2 indicates that tenured reviewers rated this item 
above their mean response of 2.04 while a negative value indicates that this item rated 
below the mean. The values measure the number of standard deviations above or below 
the mean. Column 3 of Table 2 was created for pre-tenured faculty in the same way. 
The largest differences in column 4 are indications of divergent opinions (in the relative 
importance of an item) between the two groups (tenured reviewers vs. pre-tenured fac¬ 
ulty). 

In this article, we generally focus on the relative value ascribed to various dossier items. 
However, for those who wish to directly compare the average response of the two groups 
for a given item, please refer to Table 3. This table was generated using a 2-tailed /-test 
for 2 samples with unequal variance, //-values that are less than 0.05 indicate statistically 
significant differences. This analysis does not address the relative importance of a given 
item, but instead gives a measure of the probability that the responses for reviewers and 
pre-tenured faculty members come from populations with the same mean (for that dossier 
item). The smaller the //-value, the less likely that the two populations have the same 
mean. Even items with strong relative agreement such as course exams or letters from 
students can show significant differences in means. 

Discussion and Conclusions 

The data reported above indicates that tenured reviewers and pre-tenured faculty mem¬ 
bers do not rate all portfolio items in a similar manner. It is interesting that tenured re¬ 
viewers clearly separate portfolio items into two distinct tiers: one with significant value 
to their review process and the other with much less value. Although pre-tenured faculty 
had a clear (and statistically significant) preference for chair’s annual review, a descrip¬ 
tion of teaching activities, faculty narrative, and student work when compared with peer 
observation reports, letters from students, course exams, and student evaluations, they 
were, in general, much more egalitarian in their view of the importance of various dossier 
items. 
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TABLE 2. Comparison of Z-scores of Various Components of a Teaching Dossier. 


Dossier Item 

Reviewers 

Z-score 

Pre-tenured Faculty 

Z-score 

Difference 

Contributions to teaching within 
the university 

0.974 

0.550 

0.424 

Course and curriculum devel¬ 
opment 

0.949 

0.280 

0.669 

Teaching awards 

0.925 

-0.151 

1.076 

Student evaluations 

0.900 

-1.93 

2.83 

Contributions to teaching na¬ 
tionally or internationally 

0.850 

0.226 

0.624 

Faculty narrative 

0.751 

0.981 

-0.230 

Department chair’s annual re¬ 
view 

0.701 

1.52 

-0.819 

Peer observation reports 

0.701 

-1.01 

1.715 

Description of teaching activi¬ 
ties 

-0.291 

1.14 

-1.434 

Letters from outside reviewers 

-0.465 

-0.906 

0.441 

Course syllabi 

-0.564 

0.658 

-1.222 

Letters from faculty within the 
university 

-1.11 

-0.151 

0.959 

Student work/artifacts 

-1.16 

0.874 

-2.033 

Course exams/tests/quizzes 

-1.48 

-1.07 

-0.414 

Letters from students 

-1.68 

-1.01 

-0.667 


Notes: 

Mean for reviewers = 2.04 Mean for pre-tenured faculty = 2.06 

Standard deviation = 0.403 Standard deviation = 0.185 

Zscore = (item -mean)/(standard deviation) 


In view of the high stakes nature of the review process, it is worrisome that some dossier 
documents are viewed quite differently by reviewers as compared to those faculty mem¬ 
bers being reviewed. Review committees tend to highly rate the items that have long 
been regarded as important measures of teaching quality/effectiveness. These items in¬ 
clude course and curriculum development, teaching awards, and student evaluations. We 
note that documentation of curriculum development and teaching awards are more in the 
nature of the objective evidence that is common when evaluating research. The faculty 
narrative and peer observation reports are newer requirements and perhaps more subjec¬ 
tive, but highly valued. The authors of this article were surprised that student 
work/artifacts were not more highly regarded by review committees. We speculate that 
the perceived value may increase as committees become more accustomed to evaluating 
this material. 
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TABLE 3. /^-values for a 2-tailed /-test for 2 samples with unequal variance. 


Dossier Item 

//-value 

Contributions to teaching within the university 

0.0639 

Course and curriculum development 

0.0159 

Teaching awards 

0.0210 

Student evaluations 

0.0000104 

Contributions to teaching nationally or internationally 

0.104 

Faculty narrative 

0.500 

Department chair’s annual review 

0.914 

Peer observation reports 

0.00209 

Description of teaching activities 

0.00670 

Letters from outside reviewers 

0.959 

Course syllabi 

0.0097 

Letters from faculty within the university 

0.0135 

Student work/artifacts 

0.0000967 

Course exams/tests/quizzes 

0.0275 

Letters from students 

0.00576 


Notes: For each of the fifteen items, this table compares the responses of review committee 
members to those of pre-tenured faculty. A //-value less than 0.05 means that the difference 
in responses of these two groups was statistically significant. 


Pre-tenured faculty downplayed the importance of student evaluations, teaching awards, 
and peer observation reports. The chair’s annual review, description of teaching activi¬ 
ties, faculty narrative, student work/artifacts, and course syllabi were deemed most im¬ 
portant. Many of these items give the faculty member a chance to discuss teaching phi¬ 
losophy, goals, and priorities. These items allow pre-tenured faculty a chance to inter¬ 
pret rather than simply report on classroom activities. 

While the chair’s annual review is a first tier (highly valued) item, review committees do 
not rate it as highly (relative to other items in the dossier) as pre-tenured faculty mem¬ 
bers. Presumably, this reflects the review committee’s desire to form its own judgment 
using independent evidence. Obviously, more evidence is generally available at promo¬ 
tion/tenure time than during a typical annual review. In contrast, the chair’s review may 
provide the only significant feedback available to pre-tenured faculty. It is reasonable 
that their self-assessment is strongly influenced by this evidence. 

Peer observation has become an integral part of the evaluation of untenured faculty 
members at our institution. In 1994, the state assembly required classroom observations 
of untenured faculty to be part of the review process. Throughout the university, obser¬ 
vation guidelines require both pre- and post-observation meetings. The pre-observation 
meeting allows the untenured faculty member to set the observed class in context and to 
discuss instructional goals for that class. The post-observation meeting allows the ob¬ 
server to give feedback as well as allowing the untenured faculty member to comment on 
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any actions that they may have taken during the observed class. Student evaluations are 
also required in each class taught during the academic year. It is surprising that pre¬ 
tenured faculty members have a relatively low regard for both student and peer observa¬ 
tions. These are the only direct evidence of classroom technique that is available to the 
review committee. 

One may try to account for the relatively low value that pre-tenured faculty place on peer 
observation by supposing that the process was particularly stressful, or noting that since 
very few classes are actually observed, the observed classes may not be representative. 
Finally, one could conjecture that if peer observation reports tend to include only positive 
comments, they may not be seen as particularly useful. None of these reasons seem to be 
valid. In a related article, Kohut, Yon, & Bumap (2007) examined perceptions of ob¬ 
servers and those observed during the peer review process. Data was gathered from the 
same group of pre-tenured faculty currently under discussion. When these pre-tenured 
faculty were asked whether the observed classes were representative of their teaching, 
54.2% strongly agreed, 31.9 % agreed, 11.1% were neutral, 2.8% disagreed, and 0% 
strongly disagreed. When responding to the statement “Having my classes observed was 
stressful,” 11.3% strongly agreed, 14.1% agreed, 21.1% were neutral, 32.4% disagreed, 
and 21.1% strongly disagreed. When responding to the statement “Peer observers tend to 
include only positive comments in their reports,” 5.7% strongly agreed, 20% agreed, 
34.3% were neutral, 28.6% disagreed, and 11.4% strongly disagreed. Responses to this 
survey also indicated that pre-tenured faculty rated peer observations as more useful, 
valid, and reliable than did those doing the observations. None of this seems to indicate 
that pre-tenured faculty would place such a low value on these reports. Perhaps the fact 
that peer observations may have formative as well as summative value accounts for this 
seeming inconsistency. The formative nature is more apparent in the post-observation 
discussion, while during the promotion/tenure review, peer observation reports are 
merely summative. 

The evaluation of teaching will remain controversial until there is a consensus on what 
constitutes evidence of effective teaching. Agreement on the core issues of teaching 
rated in a portfolio would be helpful (Cochran-Smith & Fries, 2002; Delandshere & 
Arens, 2001, 2003) but such an agreement has yet to be adopted in the teaching profes¬ 
sion (Babin, Shaffer, & Morgan, 2002; Murphy & MacLaren, 2007; Murphy, MacLaren, 
& Flynn, 2009; Zuzowsky & Libman, 2002). Work by Sorcinelli (2006, 1993) and 
Chickering and Gamson (1987) have helped in this regard. Sorcinelli suggested that 
good teaching has five dimensions: dynamism/enthusiasm, command of the subject, or¬ 
ganization/clarity, faculty-group interaction, and faculty-student contact. However, these 
exists little agreement in this set of characteristics although Chickering and Gamson 
suggest a set of behaviors that help guide instructors interested in improving their teach¬ 
ing such as giving prompt feedback, communicating high expectations, and respecting 
diverse talents and ways of learning. 

The responses to this survey indicate that there are some significant points of disagree¬ 
ment. This undermines the evaluation process. If pre-tenured faculty and reviewers 
making high-stakes decisions see the importance of portfolio documents differently, a 
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faculty member’s choices and presentation of this material may adversely affect the pro¬ 
motion/tenure decision. We hope that this article will provide insights into the nature of 
these differences, and a basis for a collegial discussion of these issues. 

Teaching must be recognized within institutions for teaching dossiers to become more 
widely adopted in higher education. Central to this is having faculty and administrators 
adopt similar views of the purpose of portfolios and the relative importance of portfolio 
content. As Arreola (2007) observed, only when the elements of a faculty evaluation 
program (summative purpose) are carefully integrated into a professional enrichment 
program (fonnative purpose) does the institution obtain the greatest benefit. 
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